Learning using Local Membership Queries under Smooth Distributions

نویسندگان

  • Pranjal Awasthi
  • Varun Kanade
چکیده

We introduce a new model of membership query (MQ) learning, where the learning algorithm is restricted to query points that are close to random examples drawn from the underlying distribution. The learning model is intermediate between the PAC model (Valiant, 1984) and the PAC+MQ model (where the queries are allowed to be arbitrary points). Membership query algorithms are not popular among machine learning practitioners. Apart from the obvious difficulty of adaptively querying labellers, it has also been observed that querying unnatural points leads to increased noise from human labellers (Lang and Baum, 1992). This motivates our study of learning algorithms that make queries that are close to examples generated from the data distribution. We restrict our attention to functions defined on the n-dimensional Boolean hypercube and say that a membership query is local if its Hamming distance from some example in the (random) training data is at most O(log(n)). We show three positive learning results in this model: (i) The class of O(log(n))-depth decision trees is learnable under a large class of smooth distributions using O(log(n))-local queries. (ii) The class of polynomial-sized decision trees is learnable under product distributions using O(log(n))-local queries. (iii) The class of sparse polynomials (with coefficients in R) over {0, 1}n is learnable under smooth distributions using O(log(n))-local queries. The author is supported in part by the National Science Foundation under grants CCF-1116892 and IIS-1065251. Part of this work was done when the author was visiting Microsoft Research, New England. The author is supported by a Simons Postdoctoral Fellowship. Part of this work was performed while the author was at Harvard University supported by grant NSF-CCF-09-64401

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Using Local Membership Queries

We introduce a new model of membership query (MQ) learning, where the learning algorithm is restricted to query points that are close to random examples drawn from the underlying distribution. The learning model is intermediate between the PAC model (Valiant, 1984) and the PAC+MQ model (where the queries are allowed to be arbitrary points). Membership query algorithms are not popular among mach...

متن کامل

On Boosting with Polynomially Bounded Distributions

We construct a framework which allows an algorithm to turn the distributions produced by some boosting algorithms into polynomially smooth distributions (w.r.t. the PAC oracle’s distribution), with minimal performance loss. Further, we explore the case of Freund and Schapire’s AdaBoost algorithm, bounding its distributions to polynomially smooth. The main advantage of AdaBoost over other boosti...

متن کامل

Simple Learning Algorithms for Decision Trees and Multivariate Polynomials

In this paper we develop a new approach for learning decision trees and multivariate polynomials via interpolation of multivariate polynomials. This new approach yields simple learning algorithms for multivariate polynomials and decision trees over nite elds under any constant bounded product distribution. The output hypothesis is a (single) multivariate polynomial that is an-approximation of t...

متن کامل

Learning Unions of Tree Patterns Using Queries

This paper characterizes the polynomial time learnability of TP k , the class of collections of at most k rst-order terms. A collection in TP k de nes the union of the languages de ned by each rst-order terms in the set. Unfortunately, the class TP k is not polynomial time learnable in most of learning frameworks under standard assumptions in computational complexity theory. To overcome this co...

متن کامل

Learning Minor Closed Graph Classes with Membership and Equivalence Queries

The paper considers the problem of learning classes of graphs closed under taking minors It is shown that any such class can be properly learned in polynomial time using membership and equivalence queries The representation of the class is in terms of a set of minimal excluded minors obstruction set

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1211.0996  شماره 

صفحات  -

تاریخ انتشار 2012